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Abstract 

We present a polynomial time dynamic programming algorithm for optimal parti- 
tions in the shortest path metric induced by a tree. This resolves, among other things, 
the exact complexity status of the optimal partition problems in one dimensional geo- 
metric metric settings. Our method of solution could be also of independent interest in 
other applications. We discuss also an extension of our method to the class of metrics 
induced by the bounded treewidth graphs. 

1 Introduction 

The optimal partition problems for unweighted and weighted graphs are classical NP-hard 
combinatorial optimization problems. 

The typical partition problems, like MAX-CUT and MAX-BISECTION are well known 
to be APX-hard [£]. Also the existence of a PTAS for MIN-BISECTION has been likely 
ruled out in [13J. The metric counterparts of these problems are to find the corresponding 
optimal partitions of the complete graph on the input point set, where edges are weighted 
by metric distances between their endpoints. The metric MAX-CUT, MAX-BISECTION, 
MIN-BISECTION and other partitioning problems were all proved to have polynomial time 
approximation schemes (PTAS) P, |8l [HI El [3 El [10]. These problems are known to be 
NP-hard in general metric setting (e.g., for 1 — 2 metrics |8]). Their exact computational 
status for geometric (i.e., Lp) metrics and this even for dimension one was widely open. 

In this paper, in particular, we resolve the complexity status of these problems for just 
dimension one by giving a polynomial time algorithm. Our solution, somewhat surprisingly, 
involves certain new ideas for applying dynamic programming which could be also of in- 
dependent interest (see [12] for a preliminary version). In fact, our dynamic programming 
method works for the more general case of tree metric spaces, where the underlying trees have 
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nonnegative real edge weights (see also the embeddability properties of arbitrary metrics in 
tree metrics Observe that the one dimensional case can be modeled by line graphs with 
nonnegative real edge weights. We also give an evidence that our polynomial time method 
can be extended to include analogous partition problems in metric spaces induced by shortest 
paths in graphs of constant treewidth. 



2 Preliminaries 

We define our dynamic programming method in terms of generalized subproblems on finite 
multisets of vertices in an undirected graph with nonnegative real edge weights. 

For a partition of a finite multiset P of graph vertices into two multisets Pi and P21 the 
value of the cut is the sum over all pairs (f , w) E Pi x P2 of length (i.e., total weight) of a 



shortest path connecting v with w. For example, see Fig. la 

The MAX-CUT problem for P will be now to find a partition of P into two multisets that 
maximizes the value of the cut. If |-P| = n and the two multisets are additionally required to 
be of cardinality k and n — k, respectively, then we obtain the {k, n — k) MAX-PARTITION 
problem for P. In particular, if n is even and k = n/2 then we have the MAX-BISECTION 
problem. Next, if we replace the requirement of maximization with that of minimization 
then we obtain the (fc, n - k) MIN-PARTITION and MIN-BISECTION problems for P, 
respectively. 

The aforementioned optimal partition problems can be generalized to include a multiset 
of points in an arbitrary metric (in particular, geometric) space by just replacing the weight 
of a shortest path connecting v with w with the distance between v and w. 
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Figure 1: The cut values of the first partition (a) and the second partition (b) are respectively 
8 and 6. It follows that single geometric cuts are not always sufficient to generate minimum 
bisections on the real line. 



Given a graph G = (V, £■), a tree decomposition is a pair (X, T), where X = {Xi, . . . ,Xi} 
is a family of subsets (called bags) of V, and T is a tree whose nodes are the bags Xi, satisfying 
the following properties [21 section 12.3]: 

1. The union of all bags Xi equals V. 

2. For each {v,w) G E, there is a bag Xi that contains both v and w. 

3. If Xi and Xj both contain a vertex v, then all nodes Xk of the tree in the (unique) 
path between Xi and Xj contain v as well. 

The width of a tree decomposition is the size of its largest set Xi minus one. The treewidth 
tw{G) of a graph G is the minimum width among all possible tree decompositions of G. 
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3 The Algorithm 



Let T be a tree with at most n vertices and nonnegative real edge weights. Next, let P be 
a multiset of vertices of T whose cardinahty does not exceed n. 

Let us root T at some vertex r. Next, for each vertex v of T, let stands for the subtree 
of T induced by all vertices of T from which the path to r passes through v. We shall assume 
that is rooted at v. Finally, let be the set of all elements in P that are copies of vertices 
in Ty, and let P{v) stand for the set of all elements in P that are copies of the vertex v. 

Consider a subtree Ty of T and a partition of P into two multisets A and B. The crucial 
observation is as follows. 

In order to compute the total weight of shortest paths or their fragments contained within 
T.,,. it is sufficient to know the partition of P,, into Py H A and Py fl B, and just the number 
of elements in P \ Py that belong to A or 5. 

Indeed, the specific placement of the elements of {P\Py)nA oi {P\ Py) OB in the tree T 
is not relevant when we consider solely the maximal fragments of shortest paths connecting 
to these elements that are contained within Ty. 

The subproblem MAXCUT{v,p,q, s,t) consists in finding a partition of Py into two 
multisets A and B of cardinality p and q, respectively, that maximizes the sum of: 

1. the total weight of shortest paths between pairs of elements in Ax B; 

2. t times the total weight of the paths between pairs of elements in A x {v}; 

3. s times the total weight of the paths between pairs of elements in 5 x {v}. 

The parameters s and t are interpreted as the number of elements in P \ P„ that belong 
to the same set as those in A or S, respectively, in the sought two partition of P. 

We shall denote the maximum possible value of the sum by mc{v,p,q, s,t). Note that 
the total number of subproblems is O(n^). 

Assume first that T is binary. 

If Ty is the singleton {{v}, 0) then MAXCUT{v,p, q, s, t) can be trivially solved in 
time. Otherwise, we can reduce the subproblem to smaller ones by the recurrences given in 
the following lemmata. 

Lemma 1. Suppose that v has two children vi and v-j. The value mc{v,p,q,s,t) is equal to 
the maximum over partitions of\P{v)\ into the sum of natural numbers p{v), q{v), partitions 
of |Pt,J into the sum of natural numbers pi and qi, and partitions of jP^jl into the sum of 
natural numbers qi, q2, where p = p{v) + Pi+ P2 and q = q{v) + qi + q2, of the total of: 

1. mc{vi,pi, qi,s + p2+ p{v),t + q2 + q{v)); 

2. mc{v2,P2,q2,s + pi +p{v),t + qi + q{v)); 

3- {piq2 + P2qi){weight{vi,v) + weight{v2,v)); 

4- Pit X weight{vi,v) + p2t x weight{v2,v); 
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5. qis X weight{vi,v) + q2S x weight{v2,v) . 



Proof. The first and second component values correspond to the total weights of shortest 
paths or their fragments within T^- connecting pairs of elements in P^.xP^- or in P^^ x (P\P^.), 



i G {1, 2}, that belong to different sets in the sought two partition of P (see Fig. 2a). 

The three remaining ones correspond to the total weight of the edges {v,vi} and {v,V2} 
on shortest paths between copies of vertices in P belonging to different sets in the sought 
two partition. In particular, the third component value corresponds to the total weight of 
the {v,Vi} and {v,V2} fragments of shortest paths connecting pairs {a,b} of elements in P, 
where a is a copy of a vertex in T^^ while 6 is a copy of a vertex in T^^ a, b belong to 



different sets of the two partition (see Fig. 2b). 

Next, the fourth component value corresponds to the total weight of the {v,vi} and 
{^,^2} fragments of shortest paths connecting pairs {a,b} of elements in P, where a is a 
copy of a vertex in T^^ or T^^ belonging to the first set in the sought partition, while 6 is a 
copy of a vertex in P \ P„ belonging to the second set in the partition. The fifth component 
value can be specified symmetrically by switching the first set with the second set in the two 
partition. For pit x weight{vi,v) term see Fig. 2c, the other terms are symmetrical. □ 










(b) 



Figure 2: Distinct cases in the proof of Lemma 1 



Lemma 2. Suppose that v has a single child vi. The value mc{v,p,q,s,t) is equal to the 
maximum over partitions of\P{v)\ into the sum of natural numbers p{v), q{v), of the total 
of- 

1. mc{vi,p - p{v), q - g(f), s + p{v),t + q{v)); 

2. {p — p{v))t X weight{vi,v); 
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3- {q — q{v))s X weight{vi,v) . 



Proof. The first component value corresponds to the total weight of shortest paths or their 
fragments within T^. connecting pairs of elements in x P^^ or in Py^ x (P \ P^^) that 
belong to different sets in the sought two partition of P (see Fig. 3a). The second component 
value corresponds to the total weight of the {v,vi} fragments of shortest paths connecting 
pairs {a, b} of elements in P, where a is a copy of a vertex in T^,^ belonging to the first set in 
the sought partition, while 6 is a copy of a vertex in P \ P„ belonging to the second set in the 
partition (see Fig. 3b). Finally, the third component value can be specified symmetrically 
by switching the first set with the second set in the two partition. □ 




(a) (b) 
Figure 3: Two cases in the proof of Lemma [2] 

In Lemma [l| after picking a partition of |P(f)| and \Pvi\, the partition of jP^jl is deter- 
mined by the constraints p = p{v) +pi +p2 and q = q{v) +qi + q2- It follows that the number 
of candidates for partition sequences in Lemma [l] does not exceed O(n^) while that number 
in Lemma [2] is only 0(n). Furthermore, if the input multiset P is a set, i.e., |P(f)| < 1 for 
all vertices v, then the number of candidate partition sequences Lemmata [T| [2] is 0{n) and 
2, respectively. We conclude that we can compute mc{v,p,q,s,t) for all the corresponding 
subproblems MAXCUT{v,p,q,s,t) in a bottom up fashion in order of nondecreasing \Ty\ 
in 0{n^) time in the general case and in 0{n^) time if P is a set. 

Now, it is sufficient to find the maximum among the values of the form mc(r, p, g, 0, 0) 
(recall that r is the root of T), to obtain the maximum value of a cut for P. If we additionally 
require p = q then we obtain a maximum value of bisection etc. By backtracking, we can 
construct maximum cut, maximum bisection etc. The minimum variants of the partition 
problems can be solved analogously. 
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If the rooted T is not binary, we can easily transform it to a binary tree T' by adding 
totally at most n — 2 dummy vertices between parents with more than two children and their 
respective children. The dummy vertices between a parent v and its children form a binary 
subtree rooted at v whose leaves are the children. We set the weight of the edges incident 
to the children to the corresponding weights of the edges between v and their respective 
children in T. All other edge weights are set to zero in the subtree. The considered optimal 
cut and partitions problems for P in T are equivalent to those for P in T'. 

Theorem 1. Let T be a tree with at most n vertices and nonnegative real edge weights, 
and let P be a multiset of vertices in T with cardinality at most n. The MAX-CUT, 
MAX-BISECTION, {k, n-k) MAX-PARTITION, MIN-BISECTION and {k, n - k) MIN- 
PARTITION problems for P can be solved by dynamic programming in 0{n^) time. If P is 
a set then these problems can be solved in 0{n'^) time. 

The geometric optimal partition problems for a finite mulitset of points on the real line 
can be equivalently formulated as the corresponding partition problems in the shortest-path 
metric induced by the line graph whose vertices correspond to the points in the multiset 
and whose edges correspond to the minimal intervals between different points. The edges 
are weighted with the length of intervals. Since in the case of a line graph, the reductions 
of a subproblem to smaller ones rely solely on Lemma [2| the asymptotic time complexity 
decreases by n. 

Corollary 1. The geometric MAX- CUT problem on the real line as well as the geometric 
MAX-BISECTION, {k, n - k) MAX-PARTITION, MIN-BISECTION and (fc, n - k) MIN- 
PARTITION problems on the real line are solvable in O(n^) time. If the input multiset of 
points is a set then these problems are solvable in in 0{n?) time. 

4 Extensions to Graphs of Bounded Treewidth for 
Minimum Partition Problems 

A natural way of generalizing our method for tree metrics to those induced by graphs of 
bounded treewidth is as follows. Construct a tree decomposition T of the input graph G. 
For a bag h in the decomposition, let T}, be the subtree of T rooted at 6, and let stand for 
the subgraphs G^ of G induced by vertices contained in the bags of T;,. One could consider 
analogously subproblems with G^, the number of vertices in the first and the second set of 
sought partition within Gh and G \ Gb, respectively, as the subproblem parameters. 

Unfortunately, the aforementioned parameters do not seem sufficient to specify such a 
generalized subproblem. Simply, if b is not a singleton (as this is in the case of trees) then 
it is not clear which vertex in 6 on a shortest path connecting a vertex in Gb with a vertex 
in G\Gb belonging to a different set in the sought partition, would be the last one on the 
path within Gb- 

One could try to tackle this problem by introducing 2\b\ new parameters specifying for 
each vertex in b the number of shortest paths connecting vertices in Gb with those in G\Gb 
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belonging to a different set in the sought partition, where v is the last vertex in Gi, on the 
shortest paths before entering G \ Gb- (A single shortest path for each such pair would be 
counted.) Then, one could generalize the recurrences given in Lemmata [T| |2] but just in case 
of the minimum partition problems. 

The reason is that taking minimum would naturally force shortest path connections in 
the solutions to the subproblems. On the contrary, taking maximum would rather force 
longest path connections in these solutions. Thus, in the latter case, the generalized method 
would not yield a correct solution to the posed maximum partition problem. 

Let us take a closer look at the generalized recurrences for minimum partition problems. 
We may assume w.l.o.g that T is binary p]. Let bi and 62 be the child bags of the bag b in the 
tree decomposition T of G. For each of the aforementioned 2\b\ parameters of a subproblem 
associated with Gb, we need to consider possible partitions into at most |&i | + 162 1 components 
corresponding to the analogous parameters for the subproblems associated with Gb^ and Gb2 , 
respectively. Thus, if the width of the tree decomposition T is d, such a generalized recurrence 
would involve (77,<^('^))'^('^) = n^i^?) optimal solutions to smaller subproblems. It follows that 
the total time taken by the generalization to include minimum partitions in metric spaces 
induced by shortest paths in G would be 'nP^'^^\ The cost of the preprocessing, i.e., the 
construction of the decomposition tree of G of minimal width is marginal here [2] . 

Theorem 2. Let G he a graph of treewidth d with at most n vertices and nonnegative real 
edge weights, and let P he a multiset of vertices in G with cardinality at most n. The MIN- 
BISECTION and {k, n — k) MIN-PARTITION problems for P can be solved by dynamic 
programming in rP^'^^^ time. 

5 Final Remarks 

Our dynamic programming method for optimal partitions in tree metrics yields in par- 
ticular polynomial time solutions to these problems on the real line. It remains an open 
problem whether optimal partitions in geometric metrics of dimension greater than 1 admit 
polynomial time methods or they turn out to be inherently hard. At stake is the exact 
computational status of other geometric problems for which our knowledge is very limited at 
the moment. The exact computation complexity status of the maximum partition problems 
for shortest-path metrics in bounded treewidth graphs is also a challenging open problem. 
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